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Abstract: Eukaryotic cells must accurately and efficiently duplicate their genomes during 
each round of the cell cycle. Multiple linear chromosomes, an abundance of regulatory 
elements, and chromosome packaging are all challenges that the eukaryotic DNA 
replication machinery must successfully overcome. The replication machinery, the 
"replisome" complex, is composed of many specialized proteins with functions in 
supporting replication by DNA polymerases. Efficient replisome progression relies on tight 
coordination between the various factors of the replisome. Further, replisome progression 
must occur on less than ideal templates at various genomic loci. Here, we describe the 
functions of the major replisome components, as well as some of the obstacles to efficient 
DNA replication that the replisome confronts. Together, this review summarizes current 
understanding of the vastly complicated task of replicating eukaryotic DNA. 

Keywords: DNA replication; replisome; replication fork; genome stability; checkpoint; 
fork barriers; difficult-to-replicate sites 



1. Introduction: Basics of DNA Replication 

DNA replication, at its most fundamental, is the action of DNA polymerases synthesizing a DNA 
strand complementary to the original template strand. To synthesize DNA, the double -stranded DNA is 
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unwound by DNA helicases ahead of polymerases, forming a replication fork containing two 
single-stranded templates. Replication processes permit the copying of a single DNA double helix into 
two DNA helices, which are divided into the daughter cells at mitosis. The major enzymatic functions 
carried out at the replication fork are well conserved from prokaryotes to eukaryotes. In practice, the 
replication machinery is a massive complex coordinating many proteins all working at the site of 
replication, forming the "replisome". 

The replisome is responsible for copying the entirety of genomic DNA in each proliferative cell. 
This process allows for the high-fidelity passage of hereditary/genetic information from parental cell to 
daughter cell and is thus essential to all organisms. Much of the cell cycle is built around ensuring that 
DNA replication occurs without errors. DNA replication is an energetically costly process. In Gi phase 
of the cell cycle, many of the DNA replication regulatory processes are initiated. In eukaryotes, the 
vast majority of DNA synthesis occurs during S phase of the cell cycle, and the entire genome must be 
unwound and duplicated to form two daughter copies. During G2, any damaged DNA or replication 
errors are corrected. Finally, one copy of the genomes is segregated to each daughter cell at Mitosis or 
M phase. These daughter copies each contain one strand from the parental duplex DNA and one 
nascent antiparallel strand. This mechanism is conserved from prokaryotes to eukaryotes and is known 
as semiconservative DNA replication [1]. The process of semiconservative replication suggested a 
geometry for the site of DNA replication, a fork-like DNA structure, where the DNA helix is open, or 
unwound, exposing unpaired DNA nucleotides for recognition and base pairing for the incorporation 
of free nucleotides into double-stranded DNA (Figure 1). 

Figure 1. The replication fork. Leading-strand synthesis proceeds continuously in the 5' to 
3' direction. Lagging-strand synthesis also occurs in the 5' to 3' direction, but in a 
discontinuous manner. An RNA/DNA primer (labeled in green) initiates leading-strand 
synthesis and every Okazaki fragment on the lagging strand. 



3' 




5' 



The base pairing and chain formation reactions, which form the daughter helix, are catalyzed by 
DNA polymerases [2,3]. These enzymes move along single-stranded DNA (ssDNA) and allow for the 
extension of the nascent DNA strand by "reading" the template strand and allowing for incorporation 
of the proper purine (adenine and guanine) and pyrimidine (thymidine and cytosine) nucleobases. 
Activated free deoxyribonucleotides exist in the cell as deoxyribonucleotide triphosphates (dNTPs). 
These free nucleotides are added to an exposed 3'-hydroxyl group on the last incorporated nucleotide. 
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In this reaction, a pyrophosphate is released from the free dNTP, generating energy for the 
polymerization reaction and exposing the 5' monophosphate, which is then covalently bonded to the 3' 
oxygen. Additionally, incorrectly inserted nucleotides can be removed and replaced by the correct 
nucleotides in an energetically favorable reaction. This property is vital to proper proofreading and 
repair of errors that occur during DNA replication. 

Owing to the antiparallel nature of duplex DNA, DNA replication occurs in opposite directions 
between the two new strands at the replication fork. However, all DNA polymerases synthesize DNA 
in the 5' to 3' direction. Consequently, further coordination is required during DNA replication. Two 
replicative polymerases synthesize DNA in opposing orientations (Figure 1). Polymerase s (epsilon) 
synthesizes DNA in a continuous fashion, as it is "pointed" in the same direction as DNA unwinding. 
This strand is known as the "leading strand." In contrast, polymerase 8 (delta) synthesizes DNA on the 
opposite template strand in a fragmented, or discontinuous, manner and this strand is termed the 
"lagging strand" [4]. The discontinuous stretches of DNA replication products on the lagging strand 
are known as Okazaki fragments and are about 100 to 200 bases in length at eukaryotic replication 
forks. Owing to the "lagging" nature, the lagging strand generally contains a longer stretch of ssDNA 
that is coated by single-stranded binding proteins, which stabilizes ssDNA templates by preventing 
secondary structure formation or other transactions at the exposed ssDNA. In eukaryotes, ssDNA 
stabilization is maintained by the heterotrimeric complex known as replication protein A (RPA) 
(Figure 2) [5,6]. Each Okazaki fragment is preceded by an RNA primer, which is displaced by the 
procession of the next Okazaki fragment during synthesis. In eukaryotic cells, a small amount of the 
DNA segment immediately upstream of the RNA primer is also displaced, creating a flap structure. 
This flap is then cleaved by endonucleases (such as Fenl, discussed later). At the replication fork, the 
gap in DNA after removal of the flap is sealed by DNA ligase I [7-9]. Owing to the relatively short 
nature of the eukaryotic Okazaki fragment, DNA replication synthesis occurring discontinuously on 
the lagging strand is less efficient and more time consuming than leading-strand synthesis. 

Figure 2. The eukaryotic replisome complex coordinates DNA replication. Replication on 
the leading and lagging strands is performed by Pol s and Pol 8, respectively. Many 
replisome factors (including the FPC [fork protection complex], Claspin, Andl, and RFC 
[the replication factor C clamp loader]) are charged with regulating polymerase functions 
and coordinating DNA synthesis with unwinding of the template strand by Cdc45-MCM 
[mini-chromosome maintenance] -GINS [go-ichi-ni-san]. The replisome also associates 
with checkpoint proteins as DNA replication and genome integrity surveillance mechanisms. 




PCNA 



RNA primer 
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2. DNA Polymerases and Helicases 

The two basic processes of DNA replication are unwinding of the template strand and 
polymerization of the daughter strands. Therefore, there are two main "workhorse" enzymes in the 
replisome, the replicative helicase and polymerases. The replicative helicase is responsible for 
unwinding the parental duplex DNA, exposing two ssDNA templates. The template is then utilized by 
the replicative polymerases to generate two copies of the parental genome. Recent results have yielded 
new information on the specific role of each polymerase at the replication fork and while building on 
our knowledge of the replicative helicase. Understanding the main players in DNA replication allows 
us to further appreciate their regulation by other replisome factors. 

2.1. Replicative DNA Polymerases: Further Subdivision of Labor 

DNA polymerase function is highly specialized. From viruses through mammals, defined DNA 
polymerases accomplish replication on specific templates and in narrow localizations. At the 
eukaryotic replication fork, three distinct replicative polymerase complexes contribute to canonical 
DNA replication: a, 8, and 8. These three polymerases are essential for viability of the cell [10-13]. 
Because DNA polymerases require a primer on which to begin DNA synthesis, first, polymerase a 
(Pol a) acts as a replicative primase. Pol a is associated with an RNA primase and this complex 
accomplishes the priming task by synthesizing a primer that contains a short ~10-nucleotide RNA 
stretch followed by 10 to 20 DNA bases [14,15]. Importantly, this priming action occurs at replication 
initiation at origins to begin leading-strand synthesis and also at the 5' end of each Okazaki fragment 
on the lagging strand (Figure 1). 

However, Pol a is not able to continue DNA replication. From in vitro studies using SV40 
T-antigen-dependent replication and reconstituted components, it was observed that DNA replication 
must be "handed off' to another polymerase to continue synthesis [9,16]. The polymerase switching 
requires clamp loaders (which will be discussed in detail later in this article) [17]. Initially, it was 
thought that Pol 8 performed leading-strand replication and that Pol a completed each Okazaki 
fragment on the lagging strand [17,18]. Using mutator polymerase variants and mapping nucleotide 
misincorporation events, Kunkel and colleagues found that Pol 8 and Pol 8 mutations lead to 
mismatched nucleotide incorporation only on the leading and lagging strands, respectively [19-21]. 
Thus, normal DNA replication requires the coordinated actions of three DNA polymerases: Pol a for 
prime synthesis, Pol s for leading-strand replication, and the constantly loaded Pol 8 for generating 
Okazaki fragments during lagging-strand synthesis (Figure 2). 

2.2. Helicases Unwind DNA for Replication 

For DNA polymerases to function, the double-stranded helix must be unwound to expose a 
single-stranded template. This activity is performed by the replicative helicase. In eukaryotes, the 
replicative helicase is a hexameric complex comprised of the mini-chromosome maintenance proteins 
(Mcm2-7: Mcm2, Mcm3, Mcm4, Mcm5, Mcm6 and Mcm7). The MCM helicase is an AAA + ATPase, 
a superfamily of protein complexes that process substrates through a central pore using energy release 
from ATP hydrolysis [22]. 
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MCM activity is required throughout S phase for DNA replication [23,24]. The MCM proteins are 
recruited to replication origins (during Gi phase and before DNA replication) then redistributed 
throughout genomic DNA during S phase, indicative of their localization to the replication fork [25]. 
Although it was known that MCM proteins are required for DNA replication initiation and progression, 
it was not originally clear what the enzymatic function of the MCM complex could be [26]. In a study 
using purified MCM homologue from archaea, ATP-driven helicase activity was detected in fractions 
corresponding to double hexamer forms of the complex [27]. Further, purified complexes of Mcm4/6/7 
have ATP-dependent helicase activity in vitro, unwinding DNA in the 3' to 5' direction [28,29]. These 
results, coupled with localization and knockout studies strongly favor the model suggesting that the 
Mcm2-7 hexamer is the core of the replicative helicase. However, the full replicative helicase 
responsible for replication in eukaryotic cells requires additional factors, including the go-ichi-ni-san 
(GINS) complex and Cdc45, which together form the Cdc45-MCM-GINS (CMG) complex (Figure 2). 
The function of the CMG complex will be discussed in detail later in this article. 

3. Control of DNA Replication at the Replication Fork 

Although the enzymatic processes of DNA replication such as unwinding, template generation/ 
stabilization, and daughter strand synthesis are largely determined, the control of DNA replication 
in vivo is not. DNA replication requires multiple processes to coordinate and regulate highly accurate 
and timely duplication of genomic DNA during S phase. In addition to primase, replicative 
polymerases, and helicases, the DNA replication fork requires the use of accessory proteins to 
facilitate efficient initiation and replication fork progression. The cooperative protein complexes that 
participate in DNA replication are known as the replisome (Figure 2). New findings continue to 
suggest that the size and complexity of the replisome is greater than once thought. 

3.1. Replication Initiation at Origins 

To completely duplicate the genome in a reasonable time during the cell cycle, eukaryotic cells 
initiate DNA replication at multiple sites during DNA replication, whereas prokaryotic replication 
initiates at a single locus. Replication initiator sites are known as origins of replication (Oris) and are 
recognized by the origin recognition complex (ORC) of proteins in eukaryotic cells. ORCs are found 
associated with DNA throughout the genome and form the markers to which replication forks are 
recruited in a highly regulated manner (reviewed in [30] and [31]). In some eukaryotes, such as 
budding yeast, origins are defined by conserved nucleotide sequences, known as autonomous 
replication sequences (ARSs) that mark Oris. However, in most other model eukaryotes and in all 
metazoans, replication origins are less well defined (reviewed in [32]). Origin usage in metazoans can 
be dynamic, with origin firing at different sites depending on cell type and developmental stage. 
Nevertheless, the mechanism of replisome assembly and origin firing is highly conserved. 

During late mitosis and Gi phase, Cdtl and Cdc6 (Cdcl8 in fission yeast) proteins associate with 
Ori sites throughout the genome and recruit Mcm2-7 (Figure 3A) [25,33-37]. At this time, double 
hexamers of the Mcm2-7 complex are loaded at replication origins [38,39]. This generates a 
prereplication complex (pre-RC). Origins with an associated pre-RC are considered licensed for 
replication. Licensed replication origins can then be "fired," when replication actually initiates at Oris. 
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Origin firing is brought about by multiple phosphorylation events carried out by cyclin E-CDK2 at the 
onset of S phase and by Cdc7-Dbf4 (DDK kinase or Dbf4-dependent kinase) prior to individual origin 
firing (Figure 3B) [40^14]. At this point, origin melting occurs and DNA unwinding by the helicase 
generates ssDNA, exposing a template for replication (Figure 3C). The replisome then begins to form 
with the localization of replisome factors such as Cdc45 [45,46]. Interestingly, the assembly of the 
CMG complex at origins requires the non-catalytic domain of Pol 8, suggesting a codependence 
between the replicative helicase and polymerase during replisome assembly [47,48]. Further 
recruitment assembles the remainder of replisome proteins. DNA synthesis begins on the melted 
template, and the replication machinery translocates away from the origin in a bidirectional manner. 

Figure 3. Mcm2-7 loads onto DNA at replication origins during Gl and unwinds DNA 
ahead of replicative polymerases. (A) The combined activities of Cdc6 and Cdtl bring 
MCM complexes to replication origins. (B) CDK/DDK-dependent phosphorylation of pre-RC 
components leads to replisome assembly and origin firing. Cdc6 and Cdtl are no longer 
required and are removed from the nucleus or degraded (C) MCMs and associated proteins 
(GINS and Cdc45 are shown) unwind DNA to expose template DNA. At this point replisome 
assembly can be completed and replication initiated. "P" indicates phosphorylation. 




To prevent multiple rounds of DNA replication during a single cell cycle, mechanisms have 
evolved to regulate limiting initiation factors: ORC, Cdtl, and Cdc6 (reviewed in [49]). After origin 
firing, Orel, the largest ORC subunit, is ubiquitinated and degraded [50]. Phosphorylation of human 
Orc2 leads to the dissociation of Orc2-6 from chromatin to further prohibit re -replication, while the 
yeast complex appears to remain on DNA throughout the cell cycle [51-53]. Once phosphorylated at 
the onset of S phase, Cdc6 is rapidly exported from the nucleus [54]. Geminin is a specific inhibitor of 
Cdtl. Geminin protein levels increase throughout S phase, blocking Cdtl from localizing to replication 
origins [55]. It is also well understood that Cdtl and Cdc6 undergo rapid proteasome-dependent 
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degradation (reviewed in [56,57]). Thus, duplicating the genome once and only once per cell cycle is 
controlled at the level of licensing. These multiple and redundant mechanisms prevent origin assembly 
to restrict replication licensing once per cell cycle. These controls emphasize the importance of 
preventing overreplication of chromosomal DNA, in order to preserve genomic integrity. 

3.2. The DNA Sliding Clamp: PCNA 

DNA polymerases require additional factors to support DNA replication in vivo. DNA polymerases 
have a semiclosed hand structure, which allows them to load onto DNA and translocate. This structure 
permits DNA polymerase to hold the single-stranded template, incorporate dNTPs at the active site, 
and release the newly formed double strand. However, the conformation of DNA polymerases does not 
allow for their stable interaction with the template DNA. To strengthen the interaction between 
template and polymerase, DNA sliding clamps have evolved, promoting the processivity of replicative 
polymerases. In eukaryotes, this sliding clamp is a homotrimer known as proliferating cell nuclear 
antigen (PCNA), which form a ring structure. The PCNA ring has polarity with a surface that interacts 
with DNA polymerases and tethers them securely to DNA. PCNA-dependent stabilization of DNA 
polymerases has a significant effect on DNA replication because it enhances polymerase processivity 
up to 1,000-fold [58,59]. Such stimulation of replication activity makes PCNA an essential cofactor 
in vivo. PCNA also has the distinction of being one of the most common interaction platforms in the 
replisome to accommodate multiple processes at the replication fork [60]. Therefore, PCNA is often 
viewed as a regulatory cofactor for DNA polymerases (Figure 4). 

Figure 4. PCNA is a node of protein-protein interaction in the replisome. The PCNA 
homotrimer interacts with proteins in diverse processes. Many of the protein-protein 
interactions occur through a PCNA binding motif known as the PIP box, which binds a 
central domain on PCNA (orange). 




Various PCNA modifications regulate the replisome through specific circumstances during DNA 
replication. The modifications of PCNA have dramatic effects on its function. Although there are some 
species-specific modifications of PCNA throughout eukaryota, the principles remain conserved. Upon 
DNA damage, PCNA is monoubiquitinated, which changes PCNA's affinity from replicative 
polymerases to the damage-tolerant translesion synthesis (TLS) polymerases [61,62]. PCNA 



Genes 2013, 4 



8 



ubiquitination is dependent on the DNA damage checkpoint pathway and regulates dynamic changes 
in the replication fork. This process allows for bypass of bulky DNA damage that would otherwise 
prevent replication fork progression, although this method of damage bypass is error prone [62,63]. In 
contrast, polyubiquitination of the same site directs the cell towards DNA damage bypass by poorly 
characterized, but essentially error- free mechanisms [64-66]. PCNA can also be SUMOylated (small 
ubiquitin-like modifier) at the same site in yeast, and SUMOylated PCNA exists in vertebrates [67,68]. 
This modification is thought to suppress ubiquitination of PCNA, therefore inhibiting TLS and other 
DNA repair pathways, which are potentially harmful to the cell because they can introduce mutations 
and genome rearrangements [69-71]. In addition, acetylation of PCNA may play a role in enhancing 
the processivity of associated polymerases [72]. These findings indicate that PCNA modifications play 
critical roles in controlling pathway selection for DNA damage management during DNA replication. 

3.3. Clamp Loaders 

PCNA fully encircles DNA and must be loaded onto DNA at the replication fork. At the leading 
strand, this is a relatively infrequent process, because DNA replication is essentially continuous until 
replication is terminated. However, at the lagging strand, Pol 8 is continually loaded at the start of each 
Okazaki fragment. This constant initiation of Okazaki fragment synthesis requires repeated PCNA 
loading for efficient DNA replication. PCNA loading is accomplished by the replication factor C 
(RFC) complex, which is comprised of five AAA + ATPases [73]. RFC recognizes primer-template 
junctions and loads PCNA at these sites [74,75]. The PCNA homotrimer is opened by RFC using 
energy from ATP hydrolysis and is then loaded onto DNA in the proper orientation to facilitate its 
association with the polymerase [76,77]. Clamp loaders can also unload PCNA from DNA, a 
mechanism that becomes important when replication must be terminated [77]. 

Eukaryotes have evolved multiple clamp loading complexes, each of which appears to function in a 
separate pathway. The canonical clamp loader essential for DNA replication is RFC and includes Rfcl, 
Rfc2, Rfc3, Rfc4 and Rfc5. At least three RFC-like complexes exist in eukaryotic cells. RFC Ctf18 , 
which contains Ctfl8 in place of Rfcl, promotes sister chromatid cohesion and regulates replication 
speed [78-81]. RFC Elgl , which contains Elgl, is thought to unload SUMOylated PCNA in the presence 
of DNA damage to allow for replication progression through damaged DNA templates [71]. The 
RFC Radi7/Rad24 clamp doeg nQt load pCNA? but loads the g.j.j comp i ex at DNA damage sites during 

the replication checkpoint response [82]. Thus, DNA replication can be regulated at the level of PCNA 
clamp loading, in order to accommodate multiple processes that take place during DNA replication 
(Figure 4). 

3.4. Okazaki Fragment Maturation 

DNA replication on the lagging strand is discontinuous. At the end of Okazaki fragment synthesis, 
Pol 8 runs into the previous Okazaki fragment and displaces its 5' end containing the RNA primer and 
a small segment of DNA. This generates an RNA-DNA single strand flap, which must be cleaved, and 
the nick between the two Okazaki fragments must be sealed by DNA ligase I. This process is known as 
Okazaki fragment maturation and can be handled in two ways: one mechanism is designed to process 
short flaps, while the other deals with long flaps [9]. Pol 8 is able to displace up to 2 to 3 nucleotides of 
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DNA or RNA ahead of its polymerization, generating a short "flap" substrate for Fenl, which can 
remove nucleotides from the flap, about one nucleotide at a time. By repeating cycles of this process, 
Pol 8 and Fenl can coordinate the removal of RNA primers and leave a DNA nick at the lagging 
strand [83]. It has been proposed that this iterative process is preferable to the cell because it is tightly 
regulated and does not generate large flaps that need to be excised [84]. In the event of deregulated 
Fenl/Pol 8 activity, the cell uses an alternative mechanism to generate and process long flaps by using 
Dna2, which has both helicase and nuclease activities [85]. Dna2 is localized to Okazaki fragments 
[86,87], and importantly, Dna2 is thought to act on Okazaki fragment ends that are not optimal for 
Fenl activity [88]. For example, DNA flaps containing secondary structure and long flaps are both 
poor Fenl substrates. The nuclease activity of Dna2 is required for removing these long flaps, leaving 
a shorter flap to be processed by Fenl. Electron microscopy studies indicate that nucleosome loading 
on the lagging strand occurs very close to the site of synthesis [89]. Thus, Okazaki fragment 
maturation is an efficient process that occurs immediately after the nascent DNA is synthesized. These 
redundant and proficient mechanisms showcase the importance of Okazaki fragment maturation in 
protecting genomic integrity. 

3.5. Linking Helicase and Polymerase Functions 

The DNA helicases and polymerases must remain in close contact at the replication fork (Figure 2). 
If unwinding occurs too far in advance of synthesis, large tracts of ssDNA are exposed. This can 
activate DNA damage signaling or induce aberrant DNA repair processes. To thwart these problems, 
the eukaryotic replisome contains specialized proteins that are designed to regulate the helicase activity 
ahead of the replication fork. These proteins also provide docking sites for physical interaction between 
helicases and polymerases, thereby ensuring that duplex unwinding is coupled with DNA synthesis. 

There is evidence that Cdc45 has such a role. It has been shown that Cdc45 coordinates MCM 
progression with the replisome [90]. Before DNA replication initiates, S-phase kinase DDK 
phosphorylates Mcm4, which allows stable Cdc45-MCM complex formation. This is an important step 
in replisome assembly at origins and is essential for origin firing [91]. Furthermore, during S phase, 
Cdc45 depletion prevents DNA replication stress signaling through downregulation of active DNA 
synthesis, reinforcing the notion that Cdc45 is an important regulatory protein, required for DNA 
replication [92]. 

During the past decade, a protein complex termed GINS has drawn much attention. The GINS 
complex is comprised of the Sld5 (go), Psfl (ichi), Psf2 (ni), and PsB (san) proteins and (named for 5, 
1, 2, 3 in Japanese) was originally identified in budding yeast and in Xenopus laevis extracts [93,94]. 
GINS has been shown to regulate the interaction of the MCM subunits with Cdc45 [95]. The 
high- molecular- weight complex consisting of the MCM, GINS, and Cdc45 is now known as the CMG 
complex and can be observed in human cells [96,97]. The GINS complex may provide additional 
regulation of the MCM helicase during replication progression and in response to DNA replication 
stress. Importantly, the full CMG complex is required for DNA unwinding, demonstrating that the 
complex of Cdc45-MCM-GINS is the functional DNA helicase machinery in eukaryotic cells 
(Figure 2) [98,99]. It is also worth noting that the PsB subunit is phosphorylated by DNA damage 
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signaling kinases ATM and ATR, suggesting a possible mechanism of MCM regulation upon DNA 
damage during DNA replication [100]. 

While the identification of the CMG complex adds to our understanding of the regulation of the 
MCM helicase activity, how the CMG complex interacts with the bulk of the replisome is still 
emerging. Recent studies have shown that the Ctf4/Andl protein interacts with both the CMG complex 
and Pol a [101,102]. Ctf4 is a Pol a accessory factor, which is required for the recruitment of Pol a to 
replication origins [103]. Indeed, human Ctf4/Andl stimulates Pol a function in vitro, whereas 
depletion of the protein greatly reduces replication fork progression rates in human cells [101]. 
Therefore, the interaction of CMG and Ctf4 suggests the critical role of CMG in tethering Pol a to the 
replication fork. Considering that Pol a is continually required on the lagging strand and that CMG has 
helicase activity, CMG may couple DNA unwinding with primer synthesis on the lagging strand. Such 
a coupling mechanism may help to minimize the length of the exposed ssDNA tract at the replication 
fork, thereby promoting efficient DNA replication and ensuring genomic integrity. 

How, then, is leading-strand synthesis coupled with unwinding activity? Mrcl/Claspin appears to be 
the best candidate for this task. In budding yeast, Mrcl interacts with Pol 8 as well as MCM proteins in 
unperturbed S phase and under genotoxic stress [104]. The importance of this direct link between the 
helicase and the leading-strand polymerase is underscored by results in cultured human cells, where 
Mrcl/Claspin is required for efficient replication fork progression [105]. These results suggest that 
efficient DNA replication also requires the coupling of helicases and leading-strand synthesis. 

Taken together, it appears that the CMG complex has a central role in coupling template unwinding 
with DNA polymerases. Importantly, CMG interacts with both leading- and lagging-strand polymerases 
via Mrcl/Claspin and Ctf4/And-1, respectively (Figure 2). This setup also helps coordinate leading- 
and lagging-strand synthesis, integrating required processes at the replication fork into a centralized 
replisome. In this model, protein-protein interactions allow for crosstalk among multiple enzymatic 
processes essential for DNA replication. For instance, leading-strand synthesis can be restrained to 
closely match the rate of the lagging-strand synthesis. Considering that replication efficiency is 
reduced when Ctf4/And-1 or Mrcl/Claspin is lost, the coordination of multiple activities plays a 
critical role in replisome progression. 

Additional factors are required for maintaining proper replisome structure and function. One such 
factor is the fork protection complex (FPC), which is comprised of Timeless and Tipin (Tofl-Csm3 in 
budding yeast and Swil-Swi3 in fission yeast). FPC is a replisome component and appears to play a 
key role in efficient DNA replication (reviewed in [106]). As neither FPC protein appears to have any 
enzymatic function, it is likely that their role is to maintain or stabilize the replisome during DNA 
replication. Indeed, in the absence of functional FPC proteins, DNA replication takes longer to 
complete and results in shorter replication tracts, indicating that the FPC is required for efficient 
replisome progression [107,108]. In addition, the FPC has been shown to associate with nonreplisome 
proteins involved in multiple genome maintenance mechanisms, suggesting that the FPC mediates 
interactions between the replisome and other activities that take place on DNA during replication 
(reviewed in [106]). Given that loss of Claspin, Ctf4/And-1, and the FPC leads to inefficient 
replication, maintaining proper protein-protein interactions in the replisome is vital to coordinating the 
variety of processes occurring at the replication fork during DNA replication. 
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3.6. Replication Checkpoint Proteins 

In order to preserve genetic information every time the cell divides, DNA replication must be 
completed with high fidelity. To achieve this task, eukaryotic cells are equipped with genome 
surveillance systems that detect errors or problems during DNA replication. Proteins required for this 
mechanism, termed the DNA replication checkpoint, have several mechanisms to preserve genomic 
integrity. To ensure that DNA is faithfully repaired and replicated, checkpoint proteins function to 
arrest the cell cycle to allow time for DNA repair and prevent cells from entering mitosis. Checkpoint 
proteins also directly facilitate some DNA repair pathways, while they stabilize replication fork 
structures to prevent further damage. These mechanisms are essential to avoid passing down mutations 
or other chromosome aberrations to offspring. 

The checkpoint proteins are well conserved throughout the domain Eukaryota. At the central node 
of the checkpoint networks are two phosphatidyl inositotide-3' kinase (PI3K)-like kinases, ATR and 
ATM. ATM and ATR share a consensus target phosphorylation sequence, the SQ/TQ motif. However, 
their targets and their roles in the cells differ substantially. ATR is an essential mammalian gene, 
whereas ATM is not [109,110]. This is thought to be due to the specificity of each kinase for different 
types of DNA damage. ATM arrests the cell cycle in response to DNA double strand breaks (DSBs), 
which are a relatively rare event in the cell. Conversely, ATR and its obligate checkpoint partner, 
ATR-interacting-protein (ATRIP), are responsive to stretches of ssDNA that are coated by RPA 
(Figure 2) [111,112]. Importantly, ssDNA accumulation occurs frequently during DNA replication. 
Long stretches of ssDNA are also present during replication stress. Therefore, ATR-ATRIP is heavily 
relied upon to arrest the cell cycle and to preserve genome integrity during DNA replication. Indeed, 
ATR is found on chromatin during S phase in a manner similar to RPA and the checkpoint mediator 
Claspin [113]. Moreover, studies in budding yeast demonstrated that Mecl (the ATR homologue) is 
required for normal DNA synthesis during S phase and that mecl deficiency causes dramatically 
longer S phase [114]. These findings reinforce the importance of checkpoint proteins during S phase. 

The generation of ssDNA tracts is important in initiating the checkpoint pathways downstream of 
replication-associated damage. Once sufficiently long, RPA coated ssDNA stretches are able to recruit 
ATR-ATRIP [112,115]. However, to become fully active, the ATR kinase relies on sensor proteins 
that "sense" whether the checkpoint proteins are localized to a valid site of DNA replication stress. The 
9-1-1 (Rad9-Husl-Radl) heterotrimeric clamp and its clamp loader RFC Rad17 recognize the damaged 
DNA. It has been demonstrated in vitro that RFC Rad17 recognizes gapped or nicked DNA and loads the 
9-1-1 clamp onto DNA [82]. The presence of 9-1-1 on DNA is sufficient to facilitate the interaction 
between ATR-ATRIP and a group of proteins termed checkpoint mediators, such as TopBPl and 
Mrcl/Claspin. TopBPl interacts with (and recruits) the phosphorylated Rad9 component of 9-1-1 and 
binds ATR-ATRIP, which phosphorylates Chkl [116-120]. Furthermore, Mrcl/Claspin is required for 
the complete activation of ATR-ATRIP that phosphorylates Chkl, the major downstream checkpoint 
effector kinase [121-123]. Claspin is a component of the replisome and contains a domain for docking 
with Chkl, revealing a specific function of Claspin during DNA replication: the promotion of 
checkpoint signaling at the replisome (Figure 2) [124]. 

Chkl signaling is vital for arresting the cell cycle and preventing cells from entering mitosis with 
incomplete DNA replication or DNA damage. In metazoans, active Chkl induces a robust G 2 IM 
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checkpoint by phosphorylating Cdc25A protein, a phosphatase that removes inhibitory phosphorylations 
on Cdkl and Cdk2. Once phosphorylated, Cdc25A is targeted for proteasome-dependent degradation 
by the ubiquitin ligase complex containing the F-box protein [3-TrCP [125]. Thus, Chkl maintains 
inhibitory phosphorylation of metazoan Cdkl and Cdk2 to block mitotic progression and origin firing, 
respectively [126-128]. The Chkl -dependent Cdk inhibition may constitute the single most important 
function of the ATR-Chkl checkpoint, to arrest the cell cycle and allow sufficient time for completion 
of DNA repair mechanisms, which in turn prevents the inheritance of damaged DNA. In addition, 
Chkl -dependent Cdk inhibition plays a critical role in inhibiting origin firing during S phase. This 
mechanism prevents continued DNA synthesis and is required for the protection of the genome in the 
presence of replication stress and potential genotoxic conditions [129-131]. Thus, ATR-Chkl activity 
further prevents potential replication problems at the level of single replication origins by inhibiting 
initiation of replication throughout the genome, until the signaling cascade maintaining cell-cycle 
arrest is turned off. 

3. 7. Replication through Nucleosomes 

Eukaryotic genomes are substantially more complicated than the smaller and unadorned prokaryotic 
genomes. Eukaryotic cells have multiple noncontiguous DNA components, chromosomes, each of 
which must be compacted to allow packaging within the confined space of a nucleus. Chromosomes 
are packaged by wrapping -147 nucleotides (at intervals averaging 200 nucleotides) around an 
octamer of histone proteins, forming the nucleosome [132-135]. The histone octamer includes two 
copies each of histone H2A, H2B, H3, and H4. Histone proteins are subject to a variety of 
modifications, including phosphorylation, acetylation, methylation, and ubiquitination that represent 
vital epigenetic marks. The tight association of histone proteins with DNA in nucleosomes suggests 
that eukaryotic cells possess proteins that are designed to remodel histones ahead of the replication 
fork, in order to allow smooth progression of the replisome. It is also essential to reassemble histones 
behind the fork to reestablish the nucleosome confirmation. Furthermore, it is important to transmit the 
epigenetic information found on the parental nucleosomes to the daughter nucleosomes, in order to 
preserve the same chromatin state. In other words, the same histone modifications should be present on 
the daughter nucleosomes as were on the parental nucleosomes. This must all be done while doubling 
the amount of chromatin, which requires incorporation of newly synthesized histone proteins. This 
process is accomplished by "histone chaperones" and "histone remodelers," which are discussed below 
(Figure 5). 

Several histone chaperones are known to be involved in replication-coupled nucleosome assembly. 
The FACT complex components were originally identified as proteins that greatly stimulate 
transcription by RNA polymerase II (reviewed in [136]). In budding yeast, FACT was found to interact 
with DNA Pol a-primase complex, and the FACT subunits were found to interact genetically with 
replication factors [137-139]. More recently, studies showed that FACT facilitates DNA replication 
in vivo and is associated with the replisome in budding yeast and human cells [95,140-142]. The 
FACT complex is a heterodimer that does not hydrolyze ATP, but facilitates the "loosening" of 
histones in nucleosomes [143]. How the FACT complex relieves the tight association of histones for 
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DNA in nucleosomes is unanswered. Further, the mechanisms and factors facilitating nucleosome 
removal during DNA replication remain unknown. 

Figure 5. Nucleosome displacement and deposition during DNA replication. Histones are 
removed from chromatin ahead of the replication fork. FACT may facilitate this process. 
Asfl recruits histone H3-H4 dimers to the replication fork. CAF-1 and Rttl06 load newly 
synthesized (light purple) histones to establish chromatin behind the fork. Previously 
loaded histones (dark purple) are also deposited on both daughter DNA strands. The 
histone chaperones involved in these processes are associated with replisome proteins: 



Another histone chaperone associating with the replisome is Asfl, which interacts with MCM in a 
manner dependent on histone H3-H4 dimers [144]. It is hypothesized that Asfl can pass newly 
synthesized H3-H4 dimers to deposition factors behind the replication fork [145-147]. This activity 
makes histone H3-H4 dimers available at the site of histone deposition just after replication (Figure 5). 
Another highly related problem for the replication fork in the context of chromatin is the deposition of 
histones on nascent DNA. Chromatin formation proteins deposit histones onto both newly replicated 
DNA strands to form chromatin [148]. The heterotrimeric chaperone chromatin assembly factor 1 
(CAF-1) is involved in this mechanism [149,150]. CAF-1 contains a PCNA-binding motif called 
PIP-box, allowing CAF-1 to associate with the replisome through PCNA and to deposit histone H3-H4 
dimers onto newly synthesized DNA [151,152] (Figure 5). The Rttl06 chaperone also participates in 
this process, associating with CAF-1 and H3/H4 dimers during chromatin formation [153,154]. These 
processes load newly synthesized histones onto DNA. How the existing chromatin marks are 
transferred from parental to daughter strands is not characterized. Daughter strands contain H3-H4 
from parental DNA, but the mechanism of histone transfer is a continuing field of study [155]. 

At this point, nucleosomes form by the association of histone H2A/H2B with the deposited histone 
H3-H4. This process may occur through the FACT complex, since it is associated with the replisome 
and binds free H2A-H2B; or possibly another histone H2A-H2B chaperone, Napl [156]. Electron 
microscopy studies show that this occurs very quickly, as nucleosomes can be observed forming just a 



CAF-1/Rttl06 with PCNA and FACT/Asfl with MCMs. 
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few hundred base pairs after the replication fork [157]. Therefore, the entire process of forming new 
nucleosomes takes place just after replication due to the coupling of histone chaperones to the replisome. 

4. Challenges to Replication 

The varied chromatin landscape within the eukaryotic genome requires a "toolkit" of proteins that 
facilitate replisome progression through a variety of genomic regions that are difficult to replicate. 
Defects in this mechanism result in replication fork collapse and lead to genomic instability. In this 
section, analysis will focus on a subset of challenging genomic regions for DNA replication (Figure 6). 
It has become apparent that specialized mechanisms have evolved for the replisome to replicate DNA 
through these regions. 

Figure 6. Replication Fork Barriers (RFBs) control DNA progression to protect genomic 
integrity. RFBs allow for the coordination of DNA replication with important processes on 
chromatin. (A) In budding yeast, Fobl blocks replication fork progression in the opposing 
direction to 35S ribosome transcription. (B) In fission yeast, collisions between the 
transcription machinery and the replication fork are prevented by Rebl and Sapl, which 
bind to the RFBs and block replisome progression near the 3' end of the 35S ribosome 
genes. (C) At the mating-type (matl) locus in fission yeast, Rtfl bound to the RTS1 site 
prevents replication fork progression at the 5' side of matl. Only the fork in the 3' direction 
can progress through the region. This fork pauses at MPS1 and generates an imprint, 
probably a DNA strand discontinuity, which is required for a specific recombination event 
that allows for mating-type switching. 
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4.1. Replication Fork Barriers 



In prokaryotes, such as the Escherichia coli bacterium, bidirectional replication initiates at a single 
replication origin on the circular chromosome and terminates at a site approximately opposed from the 
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origin [158]. This replication terminator region contains DNA sequences known as Ter sites, polar 
replication terminators that are bound by the Tus protein. The 7er-Tus complex counteracts helicase 
activity, resulting in replication termination [159]. In this way, prokaryotic replication forks pause and 
terminate in a predictable manner during each round of DNA replication. 

In eukaryotes, the situation differs. Replication termination typically occurs by the collision of two 
replication forks anywhere between two active replication origins. The location of the collision can 
vary based on the replication rate of each of the forks and the timing of origin firing. Often, if a 
replication fork is stalled or collapsed at a specific site, replication of the site can be rescued when a 
replisome traveling in the opposite direction completes copying the region. However, there are 
numerous programmed replication fork barriers (RFBs) and replication "challenges" throughout the 
genome. To efficiently terminate or pause replication forks, some fork barriers are bound by RFB 
proteins in a manner analogous to E. coli Tus [158]. In these circumstances, the replisome and the RFB 
proteins must specifically interact to stop replication fork progression. 

4.2. Fork Barriers at rDNA Arrays in Eukaryotic Cells 

Certain RFB proteins function to prevent collisions between replication forks and the RNA 
transcription machinery. One of the best characterized RFBs is situated near the 3' end of the 35S 
ribosomal RNA gene (rDNA) in budding yeast [160]. Budding yeast chromosome XII contains an 
array of approximately 150 rDNA repeats, which are highly transcribed in order to produce the 
ribosomes necessary for translation. Therefore, transcription forks are exceedingly common in rDNA 
regions, which can lead to head-on collisions between replication and transcription forks, possibly 
leading to collapse of one or both forks. To prevent these collisions, replisome progression opposing 
the direction of ribosome transcription is inhibited at rDNA repeats, thus allowing for replication and 
transcription to proceed in the same direction through the rDNA loci [161]. How is this polar 
replication block achieved? 

In budding yeast, the Fobl protein is responsible for the rDNA barrier (Figure 6A). Fobl binds 
RFBs located at the 3' non-transcribed region of each rDNA gene [162]. Fobl operates by looping or 
wrapping the DNA around itself to dramatically alter the local chromatin state and block replication 
from the 3' direction [163]. This Fobl -dependent chromatin structural change not only causes polar 
replication fork pausing but also leads to increased recombination at the RFB region [164]. Additional 
factors such as Tofl and Csm3, components of the FPC are required to ensure stable fork pausing at 
rDNA loci. Tofl and Csm3 appear to work together with Fobl to counteract Rrm3, a DNA helicase 
required for replication through the rDNA RFB regions. It is proposed that Rrm3 displaces Fobl to 
facilitate fork progression [165]. In addition, Rrm3 is thought to be required for replication restart in 
other genomic loci requiring displacement of nonhistone proteins from DNA [166-170]. Interestingly, 
Rrm3 translocates with the replication fork and physically interacts with PCNA [171,172]. Therefore, 
similar protein-protein interactions between RFB proteins and Rrm3 may allow for polar passage of 
replication forks through multiple fork block sites throughout the genome. 

Although Fobl is a budding yeast-specific protein, similar 3' specific replication termination occurs 
in the rDNA array in fission yeast and mammalian cells. The fission yeast rDNA unit has four RFBs 
(Terl-4), three of which are known to use RFB proteins and the FPC to achieve polar fork arrest 
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(Figure 6B). The DNA binding protein Sapl, which also recognizes the mating-type locus, causes 
polar fork arrest at Terl, while fork arrest at Ter2 and Ter3 is regulated by another RFB protein, Rebl, 
which is homologous to mammalian transcription terminator factor 1 (TTF-1) [173-176]. Human 
TTF-1 also localizes to the rDNA RFB. TTF-1 performs "double duty" at the RFB: TTF-1 promotes 
polar replication fork arrest and also mediates transcription termination at the rDNA, preventing rDNA 
transcription machinery from running past the RFB [177,178]. Interestingly, fission yeast Rebl and 
mammalian TTF-1 share the Myb-like DNA-binding domain, which is critical for DNA binding and 
protein function [179]. These proteins act to specifically inhibit polar replication progression, 
coordinating DNA replication with transcription to minimize fork collisions and genome instability. 

4.3. Replication Termination at the Fission Yeast Mating-Type Locus 

In addition to site-specific fork pausing required to prevent collision between replication and 
transcription machinery, fork pausing also allows for programmed cellular events that are coordinated 
with replication of specific genomic loci. For example, the fission yeast genome contains two 
genetically programmed fork pausing sites near the mating-type (mat!) locus: the matl pausing sites 1 
(MPS!) and replication termination site 1 (RTS1). A strong polar fork arrest at RTS1 blocks one 
replication fork moving into the matl locus, allowing only the opposing fork to migrate into the matl 
locus. This fork pauses at MPS1, generating an imprint that initiates a replication-coupled 
recombination event, leading to mating-type switching (Figure 6C) [180-185]. 

The R TS1 -dependent fork arrest is DNA sequence-specific, because inserting the RTS1 sequence 
into other genomic loci in Schizosaccharomyces pombe also results in polar fork arrest and increased 
recombination at those sites [186,187]. The RTS1 DNA sequence was used to identify a site-specific 
replication terminator factor, Rtfl. Rtfl contains an Myb-like DNA-binding domain and directly binds 
DNA to block replisome progression. Rtfl mutations disrupt polar fork arrest, indicating that Rtfl is 
required for unidirectional replication termination at the RTS1 loci [188,189]. Interestingly, efficient 
pausing at RTS1 also requires the presence of specific replisome proteins. The Pol a gene (swiT) in 
S. pombe is required to initiate recombination at the matl locus. Additionally, the swil and swi3 genes, 
encoding the fission yeast FPC subunits, are required for both MPS1 -pausing and i?757-fork 
arrest [190,191]. The requirement for both replication termination factors on the DNA and replisome 
components suggests that these proteins may interact to regulate replication pausing or termination at 
the matl locus. Together, these RFB sites provide insights into the role of chro mat in-bound proteins in 
regulating replisome progression at specific loci. 

4.4. Replication Slow Zones 

Examining DNA replication on a global level, it appears that some loci are more sensitive to DNA 
replication stress than others. Analysis of chromosomal DNA using two-dimensional agarose gel 
electrophoresis, showed that checkpoint mutants accumulate replication forks at specific genomic 
regions [114]. This led to the suggestion that certain genomic regions take longer to replicate because 
they are more difficult to replicate when compared with the rest of the genome. The replication 
fork-enriched loci were coined replication slow zones (RSZs). DNA replication through RSZs requires 
the checkpoint protein Mecl (the budding yeast ATR homologue), implying that checkpoint pathways 
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play an important role during replication of RSZs [114]. This is consistent with the observation that 
functional checkpoint proteins are generally required to replicate DNA during stress [192]. Early 
hypotheses proposed that RSZs were advantageous for general replication, allowing for the replisome 
to coordinate with other processes (e.g., slowing replication to enhance chromatid pairing or 
nucleosome loading) [114]. Recent work has suggested that mecl mutant cells fail to replicate through 
RSZs due to a partial inability to upregulate dNTP levels (relieving inhibition of ribonucleotide 
reductase II by Smll). This paucity of dNTPs leads to replication fork collapse in RSZs, because 
replication of RSZs requires higher local levels of dNTPs [193,194]. Therefore, the primary function 
of Mecl may be to act as a sensor at the replication fork, upregulating nucleotide synthesis when there 
is a demand for increased dNTP levels during DNA replication. Considering that the checkpoint is 
required even in the absence of exogenous replication stressors, something inherent to RSZs causes 
replication stress. 

As described above, DNA replication through these challenging regions requires checkpoint proteins 
in budding yeast. This is consistent with replication in the presence of exogenous genotoxic agents 
such as methyl methanesulphonate (MMS: an alkylating agent, which arrests forks) or hydroxyurea 
(HU: a ribonucleotide reductase II inhibitor, which depletes cellular dNTP pools and arrests forks). In 
the absence of Rad53 checkpoint kinase, budding yeast cells accumulate unusual replication fork 
structures in response to MMS or HU; thus cells are unable to complete replication [195,196]. In 
contrast, wild-type cells are eventually able to complete replication in the presence of genotoxic 
agents, albeit slowly and with continued activation of repair and checkpoint pathways. 

In MMS-treated cells, DNA replication stress is induced globally, and replication is generally 
slowed by the checkpoints. In the RSZ model, the stress is induced by replication of the specific 
genomic region. The replisome encounters difficulties at these regions, resulting in fork arrest and 
generation of DNA structures that are recognized by checkpoint proteins. Whether the source of 
replication stress is exogenous or endogenous, checkpoint proteins recognize stalled forks, upregulate 
nucleotide synthesis, activate repair pathways, and arrest the cell cycle. By associating with the 
replisome, checkpoint kinases are well positioned to send signals to cope with fork arrest in response 
to a multitude of replication stressors, including endogenous obstacles on DNA sequences and 
exogenous genotoxic agents (Figure 7). 

Figure 7. Replication stress is induced by multiple factors in vivo. Some stresses are 
relatively rare events such as DNA damage, while others must be encountered at the same 
genomic sites every S phase, such as difficult -to-replicate regions. 
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4.5. Fragile Sites 

Recurrent DNA breaks that can be visualized on metaphase chromosomes as breaks and gaps are 
known as fragile sites. Fragile sites lead to recurrent breaks in families and were initially characterized 
as DNA breakage sites inherited in a manner predicted by Mendelian genetics; these breaks could be 
visualized on metaphase chromosomes and mapped to specific chromosome arm regions [197]. Further 
investigations have connected the occurrence of fragile sites with human diseases such as fragile X 
syndrome, the most common heritable form of mental retardation [198]. In cultured cells from normal 
individuals, these sites are relatively stable, and chromosome breakage rarely occurs. Therefore, these 
sites are known as "rare" fragile sites in normal cells. However, breakage at these sites can be induced 
by depleting folate or by adding excess thymidine in cultured human cell lines [199-201]. Because 
excess thymidine induces DNA replication stress, these findings suggest that DNA breaks at rare 
fragile sites are induced by DNA replication problems. Such expression of "fragile sites" {i.e., 
breakage at rare fragile sites) often occurs in chromosome regions associated with nucleotide repeat 
expansion. Because nucleotide repeats are considered to be poor templates for DNA polymerases and 
can induce replication fork stalling, breakage at rare fragile sites are likely to occur during DNA 
replication [202-204]. 

By treating normal cells with various replication inhibitors, investigators identified additional 
breakage sites, known as "common" fragile sites (reviewed in [205]). Low doses of aphidicolin (a 
Pol a inhibitor) lead to consistently observed DNA breaks at over 70 genomic sites [206]. Other 
studies have reported additional fragile sites using bulky nucleotide analogues, such ciS 5 -ciZci- cytidine 
or bromodeoxyuridine [207]. These sites are less well characterized than aphidicolin-dependent fragile 
sites, but both types of fragile sites are found in AT -rich genomic loci. Thus, chromosome breakage at 
specific loci is often due to replication perturbations, which are generally provoked by inherent 
features associated with the specific chromosomal loci. 

Fragile sites appear to share common features that may help to understand the mechanisms of 
chromosome breakage. Breakage rates at aphidicolin fragile sites are dependent on drug concentrations, 
and these sites tend to form secondary structures and replicate late in S phase [208-210]. Because 
common fragile sites occur in AT-rich repeat regions, it is possible that the properties of AT -rich DNA 
hinder proper replication processes. For example, AT -repeat rich microsatellites contain sequences that 
are structurally flexible with regard to base pairing and can form hairpin secondary structures [211]. 
Perhaps during DNA replication, while DNA is unwound and ssDNA is exposed at fragile sites, the 
flexible DNA sequences allow for the formation of hairpin structures, thus preventing replication fork 
progression and in some cases leading to DNA breaks. Interestingly, all fragile sites (including 
common and rare) share a late-replicating phenotype [205]. It is possible that late S-phase replication 
of fragile sites allows for proper DNA replication of difficult templates. However, conversely, it is also 
possible that cells delay replication of fragile sites owing to the inherent difficulty of replicating the 
region. As a consequence, these fragile sites may break due to diminished dNTP pools at the end of S 
phase, thus requiring checkpoint proteins to complete the region. 

Other factors may play a role in the breakage of fragile sites. When a fragile region sequence was 
deleted from human/mouse hybrid cells, the rate of DNA breakage at the site was reduced 2- to 
12-fold [212]. However, DNA breakage at the site was not entirely abrogated, implying that chromatin 
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context, not simply DNA sequence, also contributes to breakage propensity. Additionally, ATR has 
been shown to be required for replication through these regions, suggesting intimate regulation of 
DNA replication through these regions by the cell cycle checkpoint [213]. These studies indicate that a 
combination of DNA sequence, chromatin state, and the replication timing of a specific locus contributes 
to the "difficulty" of replication in the fragile sites. Global mapping and characterizations of these sites 
will reveal how various features of fragile sites contribute to their challenging replication phenotype. 

5. Conclusions and Closing Remarks 

The study of eukaryotic DNA replication has continued to expand over the past several decades and 
will likely continue to do so, filling the knowledge gaps in the regulation of replication processes. 
Many of the recent advances in the field have focused on how specific DNA sequences, structures, and 
regions are specifically regulated. Further studies will elucidate how characterized and yet -unidentified 
replisome proteins contribute to replication processes in a site-specific manner. In our current 
understanding, the replisome and replisome-associated factors are able to respond to a variety of 
hindrances to successfully complete DNA replication (Figure 7). The above review of challenging loci 
for replication is by no means exhaustive, and new techniques will likely identify even more genome 
regions that require specialized mechanisms for successful and efficient replication. Genome -wide 
approaches will allow us to understand the genomic regions and features affected by perturbation of 
specific replisome components. Current developments in single-molecule studies will open 
opportunities to test mechanistic models of replication in a loci-specific manner. The next few years 
should see great advances in our understanding of replication regulation at the global and locus level. 
Future studies will uncover the mechanisms by which checkpoint proteins and replisome -interacting 
factors cooperate together to ensure replisome progression of difficult -to-replicate genomic regions. 
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